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Abstract. Let A be a subset of a finite field F := Z/gZ for some prime q. 
If \F\ S < \A\ < |F| 1_<5 for some <5 > 0, then we prove the estimate \A + A\ + 
\A ■ A\ > c((5)| J 4| 1 + £ for some e = e(5) > 0. This is a finite field analogue of 
a result of [ESzl983]. We then use this estimate to prove a Szemeredi-Trotter 
type theorem in finite fields, and obtain a new estimate for the Erdos distance 
problem in finite fields, as well as the three-dimensional Kakeya problem in 
finite fields. 



1. Introduction 

Let A be a non-empty subset of a finite field F. We consider the sum set 

A + A := {a + b : a, b e A} 

and the product set 

A ■ A := {a ■ b : a, b E A}. 
Let \A\ denote the cardinality of A. Clearly we have the bounds 

\A + A\,\A-A\ > \A\. 

These bounds are clearly sharp when A is a sub field of F; however when A is 
not a subfield (or an affine transformation of a subfield) then we expect some 
improvement. In particular, when F is the cyclic field F :— Z/qZ for some prime 
q, then F has no proper subfields, and one expects some gain when 1 <C \ A\ <C |.F|. 
The first main result of this paper is to show that this is indeed the case: 

Theorem 1.1 (Sum-product estimate). Let F := 'Ljq'L for some prime q, and let 
A be a subset of F such that 

\F\ S < \A\ < IF] 1 - 5 
for some S > 0. Then one has a bound of the form 

max(\A + A\,\A-A\) > c(6)\A\ 1+s (i) 

for some e = e($) > 0. 



We note that one needs both \A + A\ and \A ■ A\ on the left-hand side to obtain an 
estimate of this type; for the additive term this can be seen by considering an 

arithmetic progression such as A := {1, . . . , N}, and for the multiplicative term \A- 
A\ this can be seen by considering a geometric progression. Thus the above estimate 
can be viewed as a statement that a set cannot behave like an arithmetic progression 

l 
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and a geometric progression simultaneously. This suggests using Frciman's theorem 
[F1999] to obtain the estimate (1), but the best known quantitative bounds for 
Freiman's theorem [Cha2002] are only able to gain a logarithmic factor in \A\ over 
the trivial bound, as opposed to the polynomial gain of \A\ £ in our result. 

We do not know what the optimal value of e should be. If the finite field F were 
replaced with the integers Z, then it is an old conjecture of Erdos that one indeed 
has max(|j4 + A\, \ A-A\) > c(s)\A\ 2 ~ £ for any e > 0, and in analogy with this integer 
problem one might conjecture that max(|A + A|, \A- A\) > c(e) min(|v4| 2 ~ £ , If 1 ! 1-6 ) 
for all subsets A of F. However such an estimate, if true, is likely to be extremely 
difficult. In the integer problem, the analogue of (1) was obtained by Erdos and 
Szemeredi [ESzl983], with improvements in the value of e by various authors; at 
present the best known result is e = 1/4, obtained by Elekes [E11997]. Also, a 
continuous version of (1), for fractal subsets of the real line, was recently obtained 
by the first author [B]. 

The proof of Theorem 1.1 is based on a recent argument of Edgar and Miller 
[EdMi2003] , who solved the Erdos ring problem [EV1966]. (An alternate solution 
to this problem has also appeared in [B]). Specifically, these authors showed that 
there was no Borel subring A of the reals which has Hausdorff dimension strictly 
between and 1. Since such subrings clearly obey the identities A + A = A- A = A, 
one can see that this problem has some similarities with Theorem 1.1. We prove 
this theorem in Section 4. 

It has been known for some time that sum-product estimates have application to 
certain geometric combinatorics problems, such as the incidence problem for lines 
and the Erdos distance problem. (See e.g. [EU997], [Fal986], [ChuSzTrl992], 
[KT2001], [T.l]). Using these ideas (and particularly those from [KT2001], [T.l]), 
we can prove a theorem of Szemeredi- Trotter type in two-dimensional finite field ge- 
ometries. The precise statement is in Theorem 6.2; roughly speaking, this theorem 
asserts that if we are in the finite plane (Z/qZ) 2 and one has N lines and N points 
in that plane for some 1 < JV < g 2 , then there are at most 0{N 3 '/ 2 ~ e ) incidences; 
this improves upon the standard bound of 0(N 3 ^ 2 ) obtained from extremal graph 
theory. We state and prove this theorem in Section 6. Roughly speaking, the idea is 
to assume for contradiction that one can attain close to N z / 2 incidences, and then 
show that this forces most of the N points to lie in a (projective transformation of 
a) N 1 / 2 x N 1 / 2 grid. One then applies Theorem 1.1 to obtain a contradiction. Our 
arguments closely follow those in [KT2001], [T.l]. 

Using this Szemeredi- Trotter type theorem we can also obtain a non-trivial result 
on the finite field Erdos distance problem in the case when —1 is not a square: 
specifically, we show that N points in the finite plane (Z/qZ) 2 determine at least 
7V 1 / 2+£ distinct distances if 1 <C N <C q 2 ', this improves upon the bound of N 1 / 2 
obtainable by extremal graph theory. This result uses the Szemeredi- Trotter type 
theorem and the standard observation that the set of points equidistant from two 
fixed points p,p' all lie on a line (the perpendicular bisector of p and p'). As such 
the argument is similar to those in [ChuSzTrl992], [KT2001], [T.l], and is in fact 
rather short. We state and prove this theorem in Section 7. 
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As our final application, we give a new bound on Besicovitch sets in the three- 
dimensional finite geometry (Z/qZ) 3 . A Besicovitch set is a set which contains a 
line in every direction. The Kakeya conjecture for finite fields asserts that such sets 
have cardinality at least c(e)q 3 ~ e for each e > 0. The previous best lower bound 
known is eg 5 / 2 , and is due to Wolff [W1999] (see also [W1995], [MT]). We improve 
this to cq 5 / 2+E for some absolute constant e > 0. We prove this in Section 8, using 
some geometric ideas of the second author to transform the problem into a two- 
dimensional one, to which the Szemeredi- Trotter theorem can then be applied. An 
analogous result in the continuous geometry R 3 will appear by the second author 
elsewhere. (An earlier result of Katz, Laba, and Tao [KLT2000] also gives a similar 
result in the continuous case, but this result relies crucially on the fact that R has 
multiple scales, and so does not apply to the finite field problem). 

The third author is a Clay Prize Fellow and is supported by a grant from the 
Packard Foundation. 



2. Some results from arithmetic combinatorics 

In this section we recall some known facts about A + A, A - A, etc. Here F = Z/qZ 
is a finite field of prime order. 

We first recall the Cauchy-Davenport inequality 

|A + B|>min(|A| + |B|-l,|F|) (2) 

for any non-empty subsets A, B of F. If we are allowed to arbitrarily dilate one of 
the sets A, B then we can improve subtantially on this inequality: 

Lemma 2.1. Let A, B be finite non-empty subsets of a finite field F, and Ite 
F* := F — {0} denote the invertible elements of F. Then there exists (6F* such 
that 

\A + BZ\>wm(±\A\\B\,±\F\). (3) 



Proof We may assume without loss of generality that < ||-F|, since if 

| -A 1 1 .B | > 7}\F\ we may remove some elements from A and B without affecting the 
right-hand side of (3). Let £ be an element of F*. We use the inclusion-exclusion 
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principle 1 and the invertibility of £ to compute 
\A + B£\ = \ |J a + B£\ 



a€A 



a£ A a,a' GA:a^a' 

a£A a,a'eA:a^a' 6,b'GB 



= I^II B |-7J E XI ^,(o-o')/(6-6')' 

a.a'eA^a' b,b'eB:b^b> 

where 5ij is the Kronecker delta function. If we average this over all ^ £ F, we 
obtain 

i^E 1^1^11*1-5 E E ^ 

1 1 ££F, a,a'eA:a^a' b,b'eB:b^b' 1 1 

1 \A\ 2 \B\ 2 



>\A\\B\-- 



2 LF - 1 



>\\A\\B\ 



by our hypothesis < \\F\. The claim (3) then follows by the pigeonhole 

principle. ■ 



We now recall the following sumset estimates (see e.g. [R1996], [N1996]): 

Lemma 2.2 (Sumset estimates). Let A,B be a non-empty finite subsets of an 
additive group such that \A + B\ < ifmin(|^4|, \B\). Then we have 

\A±A±A...±A\< CK C \A\ 

for any additive combination of A, where the constants C depend on the length of 
this additive combination. 



Next, we recall Gowers' quantitative formulation [G1998] of the Balog-Szemeredi 
lemma [BaSzl994]: 

Theorem 2.3. [G1998], [B1999] Let A,B be finite subsets of an additive group 
with cardinality \A\ = \B\, and let G be a subset of A x B with cardinality 

\G\>\A\\B\/K 

such that we have the bound 

\{a + b: (a,b) G G}\ < K\A\. 



To verify our use of the principle, suppose an element x lies in N of the sets a+B£ for some AT > 
1. Then the sum J2 a eA \ a + B £\ counts x N times, while the sum a ,a> eA-.a^a' \(a+B()n(a' '+B£)\ 
counts x N(N — 1) times. Since N — N 2 — ' s arwa ys less than or equal to 1, the claim follows. 
An alternate way to obtain this lemma (which gives slightly worse bounds when |j4||I?| <C |.F|, 
but somewhat better bounds when S> \F\) is by using the Cauchy-Schwarz inequality 

\\XA * Xssllfi < \\XA * XB«IL 2 1-4 + B£\ and again randomizing over £. 
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Then there exists subsets A', B' of A and B respectively with \A'\ > cK \A\, 
\B'\ > cK- c \B\ such that 

\A'-B'\ <CK C \A\. 

Indeed, we have the stronger statement that for every a 1 E A and b' E B, there are 
at least cK~ c \A\ 5 solutions to the problem 

a! -b' = (ai - b x ) - (a 2 - b 2 ) + (a 3 - b 3 ); oi, a 2 , a 3 E A; b 1 ,b 2 ,b 3 G B. 



Note that all the above additive theorems have multiplicative analogues for mul- 
tiplicative groups. In particular there are multiplicative analogues on F provided 
we eliminate the origin from F (though in our applications this single element is 
insignificant to our estimates). 

We now recall a lemma from [KT2001] (sec also [T.l], [B]): 
Lemma 2.4. [KT2001] Let A be a non-empty subset of F such that 

\A + A\,\A-A\ <K\A\. 
Then there is a subset A' of A with \A'\ > cK~ c \A\ such that 

\A' ■ A' - A' ■ A'\ <CK C \A'\. 



Proof We outline the argument from [KT2001] or [B]. We shall use X ;$ Y to 
denote the estimate X < CK C Y. Without loss of generality we may assume that 
| .A | ;§> 1 is large, and that ^ A (since removing from A does not significantly 
affect any of the hypotheses). 

We first observe from Theorem 2.3 that we can find subsets C, D of A with 
C|, \D\ « I A I such that every element in C — D has \A\ 5 representations of 
the form 

ai - a, 2 + a 3 - a 4 + a 5 - a 6 ; a\, . . . ,a 6 e A. 
Multiplying this by an arbitrary element oiA-A-A/A-A, we see that every element 
of (C - D) ■ A ■ A ■ A/ A ■ A has > \A\ 5 representations of the form 

h - 62 + 63 - h + h - b 6 ; 61, . . . ,b 6 e A ■ A ■ A ■ A/A ■ A. 

However, by the multiplicative form of Lemma 2.2, the set A ■ A ■ A ■ A/ A ■ A has 
cardinality w \A\. Thus by Fubini's theorem we have 

\{C-D) ■ A- A- A/A- A\ < \A\. (4) 

Now we refine C and D. Since |C|, \D\ « \A\ and \A ■ A\ « \A\, we have \CD\ w 
\C\,\D\, and hence by the multiplicative form of Theorem 2.3, we can find subsets 
C, D' of C, D with |C"|, \D'\ « \A\ such that every element in CD' has > \A\ 5 
representations in the form 

j ; Ci,C2,c 3 eC; d 1 ,d 2 ,d 3 eD. 

Now let c, c' G C" and d,d' £ D be arbitrary. By the pigeonhole principle there 
thus exist c 2 E C, d 2 E D such that we have ^ |A| 3 solutions to the problem 

cidic 3 rf 3 

cd= ; Ci,c 3 eC; d 1 ,d 3 ED. 

c 2 d 2 
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We can rewrite this as 

cd — c'd' = x\ — x 2 + x% + X4 

where 



Xi = 



X 2 = 



%3 



(ci - d')dxc 3 d 3 

c 2 d 2 
d'(c' - di)c 3 d 3 

C2d 2 

d'c'(c 3 - d 2 )d 3 

c 2 d 2 
d'c'd 2 (c 2 - d 3 ) 



c 2 d 2 

For fixed c, d, c' , d' , c 2l d 2 , it is easy to see that the map from (01,03,(^1,^3) to 
(xi, x 2 , x 3 , X4) is a bijcction. Since all the xj lie in (C — D) ■ A ■ A ■ A/ A ■ A, we 
thus have ^ \A\ 3 ways to represent cd — c'd' in the form x\ — x 2 + x 3 — X4, where 
xi,x 2 ,x 3 , X4 all lie in (C — D) ■ A ■ A ■ A/A ■ A. By (4) and Fubini's theorem we 
thus have 

\C'D' - C'D'\ < \A\. 

In particular we have |C"D'| J$ |^4| ^ |C"|, which by the multiplicative form of 
Lemma 2.2 implies \C'/D'\ w \C'\. By considering the fibers of the quotient map 
(x, y) — > x/y on C x D' and using the pigeonhole principle, we thus see that there 
must be a non-zero field element x such that \C (~l Z?'a;| w \A\. If we then set 
A' := C" n D'a; we have \A'A' - A'A'\ < \A\ as desired. ■ 

In the next section we bootstrap the A ■ A — A ■ A type bound in Lemma 2.4 to 
other polynomial expressions of A. 



3. Iterated sum and product set estimates 



We now prove the following lemma, which is in the spirit of Lemma 2.2: 

Lemma 3.1. Let A be a non-empty subset of a finite field F, and suppose that we 
have the bound 

\A.A — A.A\ <K\A\ 

for some K > 1. We adopt the normalization that 1 € A. Then for any polynomial 
P of several variables and integer coefficients, we have 

\P(A,A,... ,A)\ <CK C \A\ 

where the constants C depend of course on P. 



Proof We need some notation. We say that a set A is essentially contained in B, 
and write A <s B, if we have A C X + B for some set X of cardinality \X\ < CK C . 

We have the following simple lemma of Ruzsa [R1999]: 

Lemma 3.2. Let A and B be subsets of F such that \A+B\ < CK C \A\ or \A-B\ < 
CK C \A\. Then B m A- A. 
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Proof By symmetry we may assume that |A + _B| < CK C \A\. Let X be a maximal 
subset of B with the property that the sets {x + A : x G X} are all disjoint. Since 
the sets x + A all have cardinality \A\ and are all contained in A + B, we see from 
disjointncss that \X\\A\ < \A + B\, and hence \X\ < CK C . Since the set X is 
maximal, we see that for every b S B, the set b + A must intersect x + A for some 
x e X. Thus b e x + A - A, and hence BCX + A-Aas desired. ■ 

Call an element x G F good if we have x ■ A <s= A — A. 
Proposition 3.3. The following three statements are true. 

• Every element of A is good. 

• If x and y are good, then x + y and x — y is good. 

• If x and y are good, then xy is good. 

(Of course, the implicit constants in "good" vary at each occurence). 

Proof Let us first show that every element of A is good. Since 1 £ A, we have 

\A.A-A\ < \A.A - A.A\ < K\A\ 
and hence by Lemma 3.2 

A.A A - A (5) 
which implies in particular that every element of x is good. 

Now suppose that x and y are good, thus x ■ A (s A — A and y ■ A <s A — A. Then 

(x + y)-AQx-A + y-A<^A-A + A-A. 

On the other hand, since \A — A\ < \A.A — A.A\ < K\A\, we have from sumset 
estimates (Lemma 2.2) that 

\A-A + A-A + A\< CK C \A\ 

and hence by Lemma 3.2 

A- A + A- A<& A- A. (6) 

Thus by transitivity of <s we have (x + y) ■ A g A — A and hence x + y is good. A 
similar argument shows that x — y is good. 

Now we need to show that xy is good. Since x ■ A g A — A we have 

xy ■ A <s y ■ A — y ■ A. 
But since y ■ A <e A — A, we have 

xy-A<eA-A-A + A. 
By (6) we conclude that xy is good. ■ 
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By iterating this proposition we see that for any integer polynomial P, every element 
of P(A, . . . , A) is good 2 . 

Write A 2 := A ■ A, A 3 := A ■ A ■ A, etc. We now claim inductively that A k <g A — A 
for all k = 0, 1, 2, 3, . . . . The case k = 0, 1 are trivial, and k = 2 has already been 
covered by (5). Now suppose inductively that k > 2, and that we have already 
proven that A k ~ x gi-A Thus 

for some set X of cardinality |X| < CK C ' . Clearly we may restrict X to the set 
j\k-i — (^4 — ^4). j n particular, every element of X is good. We now multiply by A 
to obtain 

A k CX-A + A- A- A-A. 

Since every element of X is good, and \X\ < CK C , we see that X ■ A <£ A — A. By 
(5) we thus have 

A fe <e A- A + A- A- (A- A). 
But by arguing as in the proof of (6) we have 

A-A + A-A-(A-A)eA-A, 
and thus we can close the induction. 

Since A k g A - A for every fc, and yl-A±(A-A)(EA- J 4by (6), we thus see 
that every integer combination of A k is essentially contained in A — A. In particular 
P(A, ... , A) <s= A — A for every integer polynomial A, and the claim follows. ■ 



4. Proof of the sum-product estimate 



We now have all the machinery needed to prove Theorem 1.1. We basically follow 
the Edgar-Miller approach, see [EdMi2003] . We write F for Z/qZ, and let F* := 
F — {0} be the invertible elements of F. Let 5 > 0, and let A be a subset of F such 
that \F\ S < \A\ < Iff-- 5 . 

Let < £ < 1 be a small number depending on 5 to be chosen later. In this section 
we use X < Y to denote the estimate X < C(5, e)Y for some C(S, e) > 0. Suppose 
for contradiction that 

\A + A\,\A-A\<\A\ 1+ t; 

Then by Lemma 2.4, and passing to a refinement of A if necessary, we may assume 
that 

\A- A- A- A\ < \A\ 1+Ce . 
We may normalize lei. By Lemma 3.1 we thus have 

\P(A,... ,A)\<\A\ 1+C ^ (7) 



An alternate way to proceed at this point is to show that the number of good points is at most 
< N; indeed, it is easy to show that any good point is contained inside (A — A + A — A) /(A — A) 
if N is sufficiently large, where we exclude from the denominator A — A of course. We omit the 
details. 
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for any polynomial P with integer coefficients, where the constants C depend of 
course on P. 

Our first objective is to obtain a linear surjection from A k to F for sufficiently large 
k: 

Lemma 4.1. There exists a positive integer k <~ 1/5, and invertible field elements 
£i, . . . , £fc e -F*, sucA fAaf 

In oifter words, we have a linear surjection from A k to F. 



Proof Iterating Lemma 2.1 about 0(1/ 8) times, we obtain £i, . . . ,£ k ^ F* such 
that 

\F\ 

i^i+---+^fci > y. 

The lemma then obtains after 0(f) applications of the Cauchy-Davenport inequality 
(2), increasing fe as necessary. ■ 

Next, we reduce the rank k of this surjection, at the cost of replacing A by a 
polynomial expression of A. 

Lemma 4.2. Let B be a non-empty subset of F, and suppose k > f is such that 
there is a linear surjection from B k to F . Then there is a linear surjection from 
B k - X to F, where B := B ■ (B - B) + B ■ (B - B). 



Proof By hypothesis, we have a surjection 

B k ->F : (ai,... ,o fc ) ^^ajtj 

j<k 

for some £i, . . . ,£ k G F. Our map cannot be one-to-one, since otherwise 

\B\ k = \F\ (contradicting primarily of \F\). 
Thus there arc (&i, . . . , b k ) ^ (b[, . . . b' k ) e B k with 

(h - + •■■ + (&*- b' k )^ k = 0. (8) 
Let bk 7^ b' k . By the surjection property 

F = BS 1 + ... + Bfa 
since F is a field, we thus have 

F = B^(b k -b' k ) + --- + B^ k (b k -b' k ) 
and substituting (bk — b' k )^ k from (8) 

F = B^(b k -bf k ) + ... + B^ k -i(b k - b' k ) - B(h - B(b k -i - 

C B& + ■ ■ ■ + Hk-i 

and the claim follows. ■ 
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Starting with Lemma 4.1 and then iterating Lemma 4.2 k times, we eventually get 
a linear surjection from a polynomial expression P(A, ... , A) of A to F, and thus 

\P(A,...,A)\>\F\. 

But this contradicts (7), if e is sufficiently small depending on S. This contradiction 
proves Theorem 1.1. ■ 



Remark. Suppose the finite field F did not have prime order. Then the analogue 
of Theorem 1.1 fails, since one can take A to be a subfield G of F, or a large subset 
of such a subfield G. It turns out that one can adapt the above argument to show 
that these are in fact the only ways in which Theorem 1.1 can fail (up to dilations, 
of course): 

Theorem 4.3. Let A be a subset of a finite field F such that \A\ > \F\ 5 for some 
< 5 < 1, and suppose that \A + A\, \A ■ A\ < K\A\ for some K > 1. Then there 
exists a subfield G of F of cardinality \G\ < K C ^\A\, a non-zero field element 
£ £ F - {0}, and a set X C F of cardinality \X\ < K c( - 5 ~> such that AC^GUX. 



It is interesting to compare the above theorem to Frciman's theorem ([F1999], 
[R1996], [Cha2002]) which does not assume control on \A- A\ but has a dependence 
on constants which is significantly worse than polynomial. It seems possible that 
the constant C(S) can be made independent of S, but we do not know how to do 
so. 

Proof (Sketch) Of course, we may assume that \F\ > K c{ ^ for some large C(d). 
We repeat the argument used to prove Theorem 1.1. This argument allows us to 
find a refinement A' of A with \A'\ > K~ c A such that \A' ■ A' - A' ■ A'\ < K C \A\. 
By dilating A and A' if necessary we may assume as before that 1 E A' (as we shall 
see, this normalization allows us to take £ = 1 in the conclusion of this theorem). 
By Lemma 3.1 we thus have \P(A' , . . . ,A')\ < K C \A'\ for all integer polynomials 
P, with the constant C depending on P of course. We may assume £ A' since 
adding to A' and A do not significantly affect the above polynomial bounds. 

We now claim that A' is contained in some subfield G of F of cardinality \G\ < 
K C ^\A\. The argument in Lemma 4.1 still gives a surjection from (A') k to F 
for some k ~ 1/5. We then attempt to use Lemma 4.2 to drop the rank of this 
surjection down to 1. If we can reduce the rank all the way to one, then we have 
by arguing as before that \F\ < K C ^\A\, so the claim follows by setting G := F. 
The only time we run into difficulty in this iteration is if we discover a linear 
surjection from some A k to F with k! > 1 which is also injective, where A is some 
polynomial expression of P(A', . . . , A'). An inspection of the proof of Lemma 4.2, 
combined with the normalizations 0, 1 € A', reveals that A must contain A'. If 
we have \A + A\ > \A\, then the linear map from (A + A) k — > F is surjective but 
not injective, which allows us to continue the iteration of Lemma 4.2. Similarly if 
\A ■ A\ > \A\. Thus the only remaining case is when \A\ = \A + A\ = \A ■ A\. But 
this, combined with the fact that 0, 1 € A, implies that A = A + A = A-A, and 
hence that A is a subfield of F. Since \A\ < K C ^\A'\, the claim follows. 
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This shows that A' is a subset of G. Since \A + A'\ < K\A\, we see from Lemma 
3.2 that A <s A' — A' , and hence A^G. Thus there exists a set Y of cardinality 
\Y\ < K c ^ such that ACG + Y. 

Let y G Y — G. To finish the proof (with £ = 1) it will suffice to show that 
| .A ("I (G + y)\ < for all such y. But observe that for any two distinct x, x' £ 

G + y, the sets xG and x'G do not intersect except at the origin (for if xg = x'g' , 
then g ^ g' , and hence x = (x' — x )-^^r E G, contradicting the hypotheses that 
x G G + y and y £ G). In particular, the sets x(A' — {0}) and x'(A' — {0}) are 
disjoint. Thus 

K\A\ > \A ■ A\ > \A n (G + - {0}| > \A n (G + y)|if- c |A| 

and the claim follows. 



5. Some basic combinatorics 



In later sections we shall use the sum- product estimate in Theorem 1.1 to various 
combinatorial problems in finite geometries. In doing so we will repeatedly use a 
number of basic combinatorial tools, which we collect here for reference. 

We shall frequently use the following elementary observation: If B is a finite set, 
and /j, : B — > R + is a function such that 

5>(6)>X, 

6GB 

then we have 



E V(b) > X/2. 

beB:fj,(b)>X/2\B\ 

We refer to this as a "popularity" argument, since we are restricting B to the values 
b which are "popular" in the sense that \i is large. 

We shall frequently use the following version of the Cauchy-Schwarz inequality. 

Lemma 5.1. Let A, B be finite sets, and let ~ be a relation connecting pairs 
(a,b) E A x B such that 

\{(a,b)eAxB:a~b}\>X 

for some X > \B\. Then 

X 2 

\{(a, a' ,6) e A x A x B : a ^ a'; a, a' - b}\ > —. 

\B\ 
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Proof Define for each b E B, define := \{a € A : a ~ b}\. Then by hypothesis 
we have 

E 

In particular, by the popularity argument we have 

E M&)>*- 

b€B:^(b)>X/\B\ 

By hypothesis, we have ^/|£?| ^ 1. From this and the previous, we obtain 

E M&)(M&)-i)>*W|B|) 

fceB: M (fc)>x/|B| 

and the claim follows. ■ 

A typical application of the above Lemma is the standard incidence bound on lines 
in a plane F 2 , where F is a finite field. 

Corollary 5.2. Let F 2 be a finite plane. For an arbitrarily collection P C F 2 of 
points and L of lines in F 2 , we have 

\{( P ,l)ePxL:pel}\<\P\ 1/2 \L\ + \P\ (9) 

Proof We may of course assume that the left-hand side of (9) is ^> \P\, since the 
claim is trivial otherwise. From Lemma 5.1 we have 

\{{p,l,l') EPxLxL-.p €lnl';l^l'}\ > | J P| _1 |{(p, Z) ePxL:pE l}\ 2 . 

On the other hand, \l D l'\ has cardinality O(l) if / ^ I', thus 

\{(p, l,l / )ePxLxL:pelf)l , ;l^ l'}\ < \L\ 2 . 

Combining the two estimates we obtain the result. ■ 

6. A SZEMEREDI-TROTTER TYPE THEOREM IN FINITE FIELDS 

We now use the one-dimensional sum-product estimate to obtain a key two-dimensional 
estimate, namely an incidence bound of Szemeredi- Trotter type. 

Let F be a finite field, and consider the projective finite plane PF 3 , which is the 
set F 3 — {(0,0,0)} quotiented by dilations. We embed the ordinary plane F 2 into 
PF 3 by identifying (x,y) with the equivalence class of (x,y,l); PF 3 is thus F 2 
union the line at infinity. Let 1 < N < \F\ 2 be an integer, and let P be a collection 
of points and L be a collection of lines in F 2 . We consider the problem of obtaining 
an upper bound on the number of incidences 

\{(p,l)€PxL:p€l}\. 

From Corollary 5.2 and the duality between points and lines in two dimensions we 
have the easy bounds 

\{(p,l) ePxL:pel}\< mm(\P\\L\^ 2 + \L\, \L\\P\^ 2 + \P\), (10) 



SUM-PRODUCT ESTIMATE 



13 



see e.g. [Boll978]. In a sense, this is sharp: if we set TV = \F\ 2 , and let P be all 
the points in F 2 C PF 3 and L be most of the lines in F 2 , then we have roughly 
\F\ 3 ~ N 3/>2 incidences. More generally if G is any subfield of F then one can 
construct a similar example with N — \G\ 2 , P being all the points in G 2 , and L 
being the lines with slope and intercept in G. 

Recently Elekes [Ell 997] observed that there is a connection between this incidence 
problem and the sum-product problem: 

Lemma 6.1. [E11997] Let A be a subset of F. Then there is a collection of points 
P and lines L with \P\ = \A + A\\A ■ A\ and \L\ = \A\ 2 which has at least \A\ 3 
incidences. 

Proof Take P = (A + A) x (A ■ A), and let L be the set of all lines of the form 
l(a, b) := {(x, y) : y = b(x — a)} where a, b are any two elements of A. The claim 
follows since (a + c, be) G P is incident to l(a, b) whenever a, b, c G A. ■ 

Thus any improvement to the trivial bound of 0{N 3 / 2 ) on the incidence problem 
should imply a sum-product estimates. Conversely, we can use the sum-product 
estimate (Theorem 1.1) to obtain a non-trivial incidence bound: 

Theorem 6.2. Let F be the finite field F :— Z/qZ for some prime q, and let P 
and L be points and lines in PF 3 with cardinality \P\, \L\ < N = \F\ a for some 
< a < 2. Then we have 

\{(p, l)ePxL:pel}\< CN 3 ' 2 - £ 

for some e = e(a) > depending only on the exponent a. 

Remark. The corresponding statement for N points and N lines in the Euclidean 
plane R 2 (or PR 3 ) is due to Szemeredi and Trotter [SzTrl983], with e := 1/6. This 
bound is sharp. It may be that one could similarly take e = 1/6 in the finite field 
case when a is sufficiently small, but we do not know how to do so; certainly the 
argument in [SzTrl983] relies crucially on the ordering properties of R and so does 
not carry over to finite fields. 

Proof We may assume that TV 3> 1 is large. By adding dummy points and lines 
we may assume that \P\ = \L\ = N. 

Fix N — \F\ a , and let < e <C 1, be chosen later. Suppose for contradiction that 
we can find points P and lines L with \P\ = \L\ = N such that 

\{( Pl l)ePxL:pel}\>N 3 ^; 

we shall use the sum-product estimates to obtain a contradiction if e is sufficiently 
small. Our arguments follow those in [KT2001], [T.Tj. 

We first use the popularity argument to control how many points are incident to a 
line and vice versa. For each peP, define the multiplicity /z(p) at p by 

/i(p) := \{l E L : p e l}\. 
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Then by hypothesis 

pGP 

and hence by the popularity argument and the hypothesis |P| = N 

peP-.Kp)^ 1 / 2 -- 
On the other hand, we observe that 

£ Mb) 2 « A^ 7 ^ £ MpXMp) - 1) 

peP:^(p)>A rl /2 + e pgP 

< iV- 1 / 2 - 5 £ |{(Z, l')€LxL:pel,l';l^ l'}\ 

pGP 

= N -i/2-e J- \{peP: P el,l'}\ 

Ll>eL:l=£l> 

< 7Y 1 / 2 + £ _ 

Thus if we set P' C P to be the set of all points p in P such that 

N 1/2 - e < fi(p) < N l ' 2+ " 

then we have 

E mp) > ^ 3/2 - £ - 

For each I E L, define the multiplicity by 

A(/) :=|{ P eP':pe;}|, 

then we can rewrite the previous as 

£a(0>jv 3/2 - 

ze£ 

By the popularity argument we thus have 

E A(J) > N^-t. 
;ei:A(z)>zv 1 /2- e 

On the other hand, we have 

E A(0<JV-V2-^A(0(A(0-l) 

;eL:A(;)»zv 1 /2+ e zgl 

< TV- 1 / 2 - £ | {(p> p') eP' xP> :p,p> El; P ^ p'}\ 

lEL 

= N -i/2-e J- \{lEL:p,p> El}\ 

p,p'eP':p=jtp' 

< n- 1 ' 2 -" e L 

p,p'£P':p=ip' 
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Thus if wc set V C L to be the set of all lines I in L such that 

N 1/2 - e < A(p) < 

then we have 

E A « £ ^ 3/2 ~ e - 

For each p E P' , let //(p) denote the multiplicity 

:=|{/eL':pe/}|; 
clearly fJ,'(p) < n{p)- We can then rewrite the previous estimate as 

Ec'(p)>^ £ . 

Thus by the popularity argument, if we set P" C P' to be the set of all points p in 
P' such that 

m'(p) £ ^ 1/2 ~ £ 

then we have 

E M'b)>iV 3/2 - e . 

pGP" 

or equivalently 

|{( P ,0eP"xL':peZ}|>C o iV 3/2 - e . 
Since \L'\ < N, we have in particular that 

|p"|>iVV2-e. (n) 

The next step is to capture a large portion of the popular point set P' inside a 
Cartesian product A x B, possibly after a projective transformation. The key ob- 
servation is that such a product arises, modulo projective transformations, whenever 
one intersects two "bushes" of lines. 

Let po be any point in P". Then by construction there are > jVV2-e lines I m L 
containing p . Each of these lines I contains > TV 1 / 2-6 points p in P 1 ; of course, all 
but one of these are distinct from po. Thus we have 

\{(p,l) EP'xL': p, Po E l,p^ Po }\ > N 1 - 2 ". 

Let us define a relation ~ on P by defining p ~ p' if p ^ p' and there is a line in V 
containing both p and p' . Since two distinct points determine at most one line, we 
thus have 

|{peF':p~ Po }\ > N 1 - 26 for all p Q E P" . 
Summing this over all po in P" , we obtain 

|{(p ,p)eP"xP':p^p'}|>iV 1 - 2£ |P"|- 

Since |P'| < N, we thus see by Lemma 5.1 that 

\{(P0,P1,P) E P" xP"xP':p~ Po ,Pi;Po ± Pi}| > N^P'f . 
By the pigeonhole principle, there thus exist distinct points Po,Pi E P" such that 

\{ P EP' :p~ P0 ,Pi}\ ^N 1 -^. (12) 
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Fix these po,Pi- By applying a projective linear transformation (which maps lines 
to lines and preserves incidence) we may assume that po, p\ are both on the line 
at infinity. Indeed, we may assume that po = [(1,0,0)] and p\ — [(0,1,0)], where 
[(x, y, z)] is the equivalence class of (x, y, z) in PF 3 . 

We first eliminate those points p in (12) on the line at infinity. Such points can only 
occur if the line at infinity is in V . But then that line contains at most 0{N x / 2+e ) 
points in P' , by the definition of L'. Thus if e is sufficiently small we have 

\{p G P' n F 2 : p ~ [(1, 0, 0)], [(0, 1, 0)]}| > N 1 -^. 

Consider the lines in L' which pass through [(1,0,0)]. In the plane F 2 , these lines 
be horizontal, i.e. they are of the form {(x,y) G F 2 : y — b} for some b G F. Let 
B C F denote the set of all such b. Since each line contains at least cN x / 2 ~ £ points 
in P' , and \P'\ < N, we know that \B\ < A fl / 2+e . Similarly the lines in L' which 
pass through [(0, 1, 0)] must in F 2 be vertical lines of the form {(x, y) G F 2 : x = a} 
for a G A, where \A\ < CN 1 ' 2 ^. We thus have 

\P' D (A x B)\ > N 1 - 05 (13) 

and 

\A\,\B\ < CN 1 / 2 ^ (14) 

Now that we have placed P' in a Cartesian grid, the next step is to exploit the form 
y = mx + b of lines in F 2 to obtain some additive and multiplicative information 
on A and B. 

Define P := P' n (A x B). By definition of P' we have 



\{leL:pel}\> N 1 ' 2 - 6 for all p G P : 
summing over P using (13) and rearranging, we obtain 

\{(p,l)ePoxL:pel}\>N 3 / 2 - c ". 

Let Lq be those lines in L which are not horizontal. Since horizontal lines can 
contribute at most |Po| < N incidences to the above expression, we have (if e is 
sufficiently large) 

\{( P ,l)ePoxLo:pel}\>N 3 ^. 
By the popularity argument, if we let L\ denote those lines in L such that 

\{ P eP :pel}\>N^ 2 - Cs 

we thus have 

\{{p,l)eP xii : P el}\>N 3 / 2 - c " 
if the implicit constants are chosen appropriately. 

Define a relation ~ between B and L\ by defining b ~ Z if there is a point p in the 
row Po H (A x {6}) such that p £ I. Note that such a point p is unique since I is not 
horizontal, and thus 

|{(M)ePxi i: 6~Z}|>iV 3 / 2 - C£ . 
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By Lemma 5.1, wc thus have 

| {(6, b', I) G B x B x L x : b, b' ~ l}\ > N 2 ~ Ce . 

By (14) and the pigeonhole principle, we thus conclude that there exists distinct 
heights b,b' G B such that 

{{leL, :b,V~l}\>N 1 - Ce . 

Fix this b, b'. By an affine transformation of the vertical variable (which does not 
affect the line at infinity) we may assume that b = and b' = 1. Since each line 
Z G L\ contains > N 1 / 2 ^ 06 points (x,t) in P a , and hence in A x B, and most of 
these have t ^ 0, 1 since Z is not horizontal, we have 

|{(x, t, Z) G A x B x Li : 0, 1 ~ Z; (x,t) G l;t + 0, 1}| > N 3 ' 2 ~ Ce . 

By definition of the relation a ~ Z, we thus have 

|{(x,i,Z,x ,zi) eAxBxIixAxi: (x ,0), (x,t), (xi,l) G Z;i ^ 0, 1}| > N 3/2 ~ Ce . 

Since the three points (x ,0), (x,i), (xi, 1) determine Z, and 

x = x + (xi - x )i, 

we thus have 

\{(t,x ,xi) EBxAxA: (l-t)x +t Xl e4;i^0,l}| > N 3 ' 2 - Ce . 

(15) 

Note that this is somewhat similar to saying that (1 — B).A + B.A C A, so we are 
getting close to being able to apply our sum-product estimate. But first we must 
perform some Balog-Szemeredi type refinements. 

Let A' C A denote those xi in A for which 

\{(t,x ) eBxAxA: (l-t)x +tx 1 G A;t ^0,1}| > A^ 1_Ce . 
From (14), (15) and the popularity argument we have 

\{(t,x ,xi) EBxAxA': (l-t)x Q +tx 1 G A;t ^ 0, 1}| > N 3 / 2 ~ Ce 

(16) 

if the implicit constants are chosen correctly. 

In particular, from (14) again we have 

\A'\ > N 3 / 2 - C£ /\A\\B\ > N^ 2 - Cs . (17) 

Also, by (16), the pigeonhole principle and (14) we may find t G B such that 
t 7^ 0, 1 and 

|{(x , xO G A x A' : (1 - t )x + hxi G A}| > N 1 ' 2 ^ 
By (14) we have 

|{(xo,xx) G Ax A' : (1 - t )xo + hxi G A}\ > N~ Ce \A\\A'\. 
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By (17), (14) and Theorem 2.3 applied to the sets (1 — to)A and toA', we thus 

have a subsets (1 — to)A of (1 — t )A and t A" of t A' with cardinalities at least 
> N i/2-Ce such that 

|(1 -t )A + t A'\ < N^ 2+Ce . 
By (17), (14) and sumset estimates, this implies in particular that 

\t A' + t A'\ <N^ 2+Ce 

and hence 

\A' + A'\<N 1 / 2+C£ . (18) 

Now we return to (16). From (14) and the pigeonhole principle we may find an 
x G A such that 

\{(t, Xl ) G B x A' : (1 - t)x + txj. G A; t ^ 0, 1}| > N l ~ Ce . 

By a translation in the horizontal variables xo, x\, A, A' we may assume that 
x = 0. Thus 

\{(t, Xl ) G (B\{0» x (A'\{0» : tei G A}| > iV 1 ^, 

since the contribution of is easily controlled by (14). By (14) and the multiplicative 
form of Theorem 2.3, we can thus find a subset A" of A'\{0} with \A"\ > N 1 / 2 - Ce 
and 

\A" ■ A"\ < N 1 ' 2 ^. 
On the other hand, from (18) we have 

\A" + A"\<N 1 ' 2+Ce . 

But this gives a contradiction to the sum product estimate (Theorem 1.1) if e is 
sufficiently small. ■ 



Remark. One can extend this result to more general finite fields F using Theorem 
4.3 as a substitute for Theorem 1.1. Informally, the result is as follows: for general 
finite fields, one has the same conclusions as Theorem 6.2 except when P has large 
intersection with a projective transformation of a Cartesian product G x G for some 
subficld G of F, and when L has large intersection with the associated collection of 
lines. We omit the details. We also remark that similar generalizations can be made 
for the problems stated in the next few sections, but the generalizations become 
quite cumbersome to state and prove and we shall not do so here. 



7. Applications to the distance set problem 



We now work in the finite field plane F 2 . Given any two points (xi,y\), (£2,2/2), 
we define the distance d{{x\,y-\), (#2,1/2)) G F by 

d{{xi,y{), (x 2 ,y 2 )) = {x\ - x 2 ) 2 + {y\ - y 2 f 

(we omit the square root to avoid some distracting technicalities). Given any col- 
lection P of points in F 2 , we define the distance set A(P) C F by 

A(P) :={d(p,p') : P ,p'eP}. 



SUM-PRODUCT ESTIMATE 



19 



The Erdds distance problem is to obtain the best possible lower bound for |A(P)| 
in terms of |P|. If —1 is a square 3 , thus i 2 = —1 for some i £ F, then the set 
P := {(x,ix) : x G F} has A(P) = {0} even though |P| = |P|. To avoid this 
degenerate case we assume that —1 is not a square, then any two distinct points 
will have a non-zero distance. From the fact that any two "circles" intersect in 
at most two points, it is then possible to use extremal graph theory to obtain the 
bound 

|A(f)| > Cl^PI 1 ^; 

see also [E1946]. This bound is sharp if one takes P — F 2 , so that A(P) is essentially 
all of F. Similarly if one takes P = G 2 for any subficld G of F. However, as in 
the previous section one can hope to improve this bound when no subfields are 
available. 

From the obvious identity 

A(A x A) = (A - A) 2 + (A- A) 2 

it is clear that this problem has some connection to the sum-product estimate. In- 
deed, any improvement to the trivial bound on |A(P) can be used (in combination 
with Lemma 2.4 and Lemma 3.1) to obtain a bound of the form in Lemma 1.1. 
We now present the converse implication, using the sum-product bounds already 
obtained to derive a new bound on the distance problem. 

Theorem 7.1. Let F = Z/pZ for some prime p with p = 3 mod 4 (so — 1 is not 

a square), and let P be a subset of F 2 of cardinality \P\ = N = \F\ a for some 
< a < 2. Then we have 

|A(P)|>j\rV2-Hr 

for some e = e(a) > 0. 

Remark. In the Euclidean analogue to this problem, with N points in R 2 , it is 
conjectured [E1946] that the above estimate is true for all e < 1/2. Currently, this 
is known for all e < - \ w 0.364 [SoTaTo2001]. However, the Euclidean results 
depend (among other things) on crossing number technology and thus do not seem 
to obviously extend to the finite field case. 

Proof We shall exploit the Szemeredi- Trotter-type estimate in Theorem 6.2 in 
much the same way that the actual Szemeredi- Trotter theorem [SzTrl983] was 
exploited in [ChuSzTrl992] for the Euclidean version of Erdos's distance problem, 
or how a Furstenburg set estimate was used in [KT2001], [T.l] to imply a Falconer 
distance set problem result. The key geometric observation is that the set of points 
which are equidistant from two fixed points lie on a line (the perpendicular bisector 
of the two fixed points). 

We may assume that \F\ and |P| are large; in particular, we may assume that F 
has characteristic greater than 2. Fix N, and suppose for contradiction that 

|A(P)| ^N 1 ' 2 ^ 



'We thank Alex Iosevich for pointing out the necessity for —1 to not be a square. 
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for some small < e <C 1 to be chosen later. For any point p G P, we clearly have 
the identity 

{(p\ r) G P x A(P) : d(p,j/) = r}| = \P\ = N 

so by Lemma 5.1 

{(p',p",r) G P x P x A(P) : d(p,j/) - d(p,p") - r;j/ ^p"}\ > N 3 ' 2 ^ . 
We can of course eliminate the r variable: 

{(p',p") G P x P : d(p,p') = d(p,p");p' ± p"}\ > N 3 ^. 

Summing this over all p G P and rearranging, we obtain 

£ \{p G P : d(p,p') = d(p,p")}| > cN 5 ' 2 ^. 

p' ,p" £P:p'^p" 

By the pigeonhole principle, there thus exists p G P such that 

^ |{p G P : d(p,p') = d(p, j>o)}| > ciV 3 / 2 - £ . 

P'eP:p'#po 

By translation invariance we may take po = (0,0). Writing p' = (a,b) and p = 
(x, y), this becomes 

£ |{(x, y) G P : (x - a) 2 + (y - bf = x 2 + y 2 }\ > cN 3 ^. 

(a,b)eP:(a,b)jt(0,0) 

Thus if we let l(a, b) denote the perpendicular bisector of (0, 0) and (a, b): 

l(a,b) := {(x,y) G F 2 : (x-a) 2 + (y-b) 2 = x 2 +y 2 } = {(x,y) G F 2 : 2ax+2by = a 2 +b 2 } 

and let L be the collection of lines {l(a, b) : (a, b) G P\{0, 0}}, then we have 

{(p,l) ePxL:pel}> cN 3/2 - £ . 

But since all the lines l(a,b) are distinct, we have \L\ — N — 1, while |P| = N. 
Thus this clearly contradicts Theorem 6.2, and we are done. ■ 



8. Application to the three-dimensional Kakeya problem 

Let F := Z/qZ for some prime q. We now use the two-dimensional Szemeredi- 
Trotter theorem to obtain a three-dimensional estimate on Besicovitch sets. 

Definition A Besicovitch set P C F 3 is a set of points which contains a line in 
every direction. 

The Kakeya set conjecture for finite fields 4 asserts that for every Besicovitch set 
P, one has the estimate |P| > C £ \F\ 3 - £ for every e > 0; see [W1999], [MT] for 
further discussion on this conjecture. Previously, the best known lower bound was 
|P| £ |P| 5/2 , obtained in [W1999] (see also [W1995], [MT]). The purpose of this 
section is to improve this bound to |P| > |_F| 5 / 2 + £ for some absolute constant e > 0. 

This is the weakest of the Kakeya conjectures; and corresponds to the "Minkowski dimension" 
form of the Kakeya conjectures in Euclidean space. There is also a Hausdorff dimension analogue 
in finite fields, as well as a "maximal function" statement; see [MT] for further discussion. 
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In fact, we can prove a somewhat stronger statement. We say that a collection L 
of lines in F 3 obey the Wolff axiom if for every 2-plane n, the number of lines in I 
which lie in it is at most We will then show 

Theorem 8.1. Let L be a collection of lines in F 3 which obey the Wolff axiom and 
have cardinality \L\ ~ \F\ 2 . Let P be a collection of points in F 3 which contains 
every line in L. Then \P\ > |F| 5 / 2+e for some absolute constant e > 0. 

To see how this theorem implies the claimed bound on Besicovitch sets, observe 
that a collection of lines consisting of one line in each direction automatically obeys 
the Wolff axiom and has cardinality ~ | ^ | 2 . 

When e = this bound was obtained in [W1999], [MT]. It was observed in 
[KLT2000], [MT] that if one replaced the finite field Z/qZ by a finite field F which 
contained a subfield G of index 2, then one could obtain a "Hciscnbcrg group" 
counterexample which showed that Theorem 8.1 must fail for that field. Thus, as 
with the previous results, this theorem must somehow use the non-existence of non- 
trivial subfields of F. It is plausible that one could use the Euclidean analogue [B] 
of Theorem 1.1 to prove a similar result in Euclidean space (which would provide 
a completely different proof of the result in [KLT2000]), but we do not pursue this 
question here. 

Proof Let < e <C 1 be chosen later. Fix L, P, and assume for contradiction that 

\P\ < \F\ 5 ^. (19) 

As usual we first begin by running some popularity arguments. Fix L, P. For each 
point p G P define the multiplicity fi(p) by 

/i(p) :=\{l€L:pel}\; 
since every line contains exactly \F\ points, we thus have 

J2Kp) = \l\\f\~\f\ 3 . 

By (19) the popularity argument, if we thus set P' C P to be the set of points 
peP where 

m(p) > \F\ 1/2 -*, 

then we have 

E M*>) > \f\ 3 . 

For each line I G L define the multiplicity X(l) by 

A(0 := \{p € P' : p € l}\. 
Then we can rewrite the previous estimate as 

Thus if we set L' C L to be the set of lines I e L such that 

A(0 > \F\, 



22 JEAN BOURGAIN, NETS KATZ, AND TERENCE TAO 

then we have 

E A w>m 3 - 

lEL' 

Now define 

/x'(p) :=|{/eL':pe/}|, 

so that 

E ^'(p) £ i f i 3 - 

Thus by the popularity argument again, if we set P" C P' to be the set of points 
where 

m'(p) > l*1 1/2 - e , 

then 

E /*'(?) £ i^i 3 - 

pGP" 

Now define 

A'(Z) :=\{ P EP" :pel}\, 

so that 

E A '«^i 3 - 

iez/ 

If we set L" C L' to be set of lines Z such that 

A'(0 > \F\, 

then we have 

E A '« £ i f i 3 ' 

ZSL" 

and thus 

\{(pJ)EP"xL": P El}\>\F\ 3 . 
By Lemma 5.1 and (19) we thus have 

\{{p, I, I') G P" x L" xL" :pel, V; I ? l'}\ > \F\ 7/2 ~ e . 

Define a relation ~ on L" by defining Z ~ I' if Z ^ Z' and the lines Z and Z' intersect 
at a point in P". From the previous we thus have 

|{(Z,Z')GL"xL":Z^Z'}|>|F| 7 / 2 - £ , 

since two lines intersect in at most one point. Applying Lemma 5.1 again with the 
bound \L"\ = 0{\F\ 2 ) we obtain 

|{(Z,Z ,Zi) G L" x L" xL":I~ Z ,Z i; Z # ZJ| > |ff- C£ . 

By the pigeonhole principle we can thus find distinct Zo, h G £' such that 

\{leL" -.l-^hjl^lFl 1 -^. 

Fix Zo,Zi, and let C L" denote the set 

£-* := {Z* G L : Z* ~ Zo, h}, 

thus we have 

\L*\ > iFf-^. (20) 
Later on we shall complement this lower bound on L* with an upper bound. 
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The strategy of the proof will be to pass from L (which is in some sense a two- 
dimensional subset of a four-dimensional algebraic variety - a Grassmannian, in 
fact), to L' (which will essentially be a one-dimensional subset of a two-dimensional 
algebraic variety - namely, the set of lines intersecting both l and The latter 
situation is much closer to the incidence problem considered in Theorem 6.2, and 
we will be able to apply that theorem after some algebraic transformations and 
combinatorial estimates. 

For future reference we observe the following non-concentration property of the 
collection of lines L' (and hence of its subsets L" and L*). 

Lemma 8.2. Let I 1 , 1 2 , 1 3 be any non-intersecting lines in F 3 (not necessarily in 
L). Then 

\{l 6 L' : I 1 , 1 2 , 1 3 all intersect l}\ < \F\ 1 / 2+Ce . 

As a particular corollary of this lemma, we see that for each 2-plane tt there are at 
most 0(\F\^ 2+C£ ) lines in L' which lie in tt. One further consequence of this is that 
Iq and l\ are skew (otherwise all the lines in would lie on the plane generated 
by Iq and h, and (20) would contradict the above corollary). 

Proof This is a variant of some arguments in [T.2] and the second author. 

There are two cases: either some of the lines in I 1 , I 2 , I 3 are parallel, or they are all 
mutually skew. If two of the lines are parallel, ten all the lines I in the above set lie in 
a plane. If they are all skew, then it is well known that I lies in a quadratic surface 5 
(i.e. a set of the form {x e F 3 : Q(x) = 0} for some inhomogeneous quadratic 
polynomial Q) known as the regulus generated by I 1 , I 2 , I 3 ; see e.g. [Schl998], 
[T.2]. Thus in cither case, all the lines I of interest lie inside an algebraic surface S 
which is either a plane or a quadratic surface. It will then suffice to show that the 
set 

L s := {I G L' : I C S} 
has cardinality at most O^F^I 2 * *). 

We first observe that there are at most 0(|F|) lines in L which lie in S. When S is 
a plane this is just the Wolff axiom. When S is a quadratic surface this is simply 
because a quadratic surface contains at most 0(\F\) lines. In particular we have 
the crude bound \L S \ = 0(\F\). 

By definition of L' we have 

\{(p,l)e(P'nS)xL s :pel}\>\F\\Ls\. 
On the other hand, by Lemma 9 we have 

\{( P , l)e(P'nS)xL s : P e l}\ < \P' n S\^ 2 \L S \ + \P' n S\. 

Combining the two bounds, we obtain after some algebra 

\P'nS\>mm(\F\\L s \,\F\ 2 ) > \F\\L S \. 

5 A model example is when the lines V are of the form V = {(xj ,y,Xjy) : y £ _F} for some 
distinct xi,X2,x:j e F. Then all the lines I lie in the quadratic surface {(x,y,xy) : x, y e _F}. 
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By the definition of P', we thus have 

\{(p,l) G (P' n S) x L : p G 1}\>\F\ 3 ^ S \L S \. 

The line Z certainly intersects S, but it need not be contained in S. By the triangle 
inequality we have either 

\{(p, l)e(P'nS)xL:pel;lcS}\> |F| 3 / 2 - £ |L s | (21) 

or 

\{(p, l)€(P'nS)xL:p£l;l<£S}\> \F\^\L S \. (22) 

Suppose first that (22) holds. Since S is either a plane or a quadratic surface, and I 
is not contained in S, it is clear that I intersects S in at most two places. Thus the 
left-hand side of (22) is bounded by at most 2\L\ = 0(\F\ 2 ), and the desired bound 
\L S \ = 0(\F\ 1 ^ 2+Ce ) follows. Now suppose that (21) holds. But then there are at 
most 0(|F|) lines I which lie in S, and each of those lines I contains at most |F| 
points p. Thus the left-hand side of (21) is boudned by 0(|F| 2 ), and the desired 
bound \L S \ = 0(|F|V2+Ce) again follows. ■ 

Let Z* be any line in L*. Then Iq, l\ each intersect I* in exactly one point. Since h 
lies in L' , we see from the definition of L" that 

\{peP":peh;p?l ,h}\>\F\. 

But then by the definition of P", wc thus see that 

\{(p, l)eP"xL': P e U;p ? l , h;p€l;l? U}\ > \F\^ 2 ~ C ". 

Observe that p is uniquely determined by I in the above set. Define H(U) C L' to 
be the set of all / e V which intersect I* in a point distinct from where Iq or l\ 
intersects I*; in the terminology of [W1995], H(l*) is the hairbrush with stem 1*. 
Then the previous estimate implies that 

\H{1*)\ > |F| 3 / 2 - Ce (23) 

for all I* eL,. 

We now complement that lower bound with an upper bound. 
Lemma 8.3. For each U S £*, we have 

\H(l*)\ < |^| 3/2+C£ . 

Proof We use the "hairbrush" argument of Wolff [W1995], [W1999], using the 
formulation in [MT]. All the lines I in H(l*) intersect U but are not coincident to 
i*. Thus we have 

\{(p,l) EPx H{h) :pel;p# h}\ = \H(h)\(\F\ - 1) ~ I^OII^I- 
By (19) and Lemma 5.1 we thus have 

\{(p, 1, G P x H(Q x H{h) :pel,l';p^U;l^ l'}\ > |F^)| 2 |^l" 1/2 " £ - 

Now fix I G H(l*), and consider how many lines V could contribute to the above 
sum. From the various constraints on p, I, I', l* we see that i, I', I* form a triangle, 
and thus I' lies on the plane generated by I and I*. By the Wolff axiom there are 



SUM-PRODUCT ESTIMATE 



25 



thus at most 0(|.F|) choices for I' for any fixed Z. Since Z and I' clearly determine 
p, we thus have 

\{(p, I, l')ePx H(Q x H(L) :pel,l';p#L;l? l'}\ < \H(h)\\F\. 
Combining this with the previous bound we obtain the Lemma. ■ 

We now refine these hairbrushes slightly. First we count how many lines Z in H(l*) 
could intersect ?o- Such lines would lie in the plane generated by Z* and Z and so 
there are at most 0(|F|) of them by the Wolff axiom (or one could use Lemma 
8.2). Similarly there are at most 0(\F\) lines in H(U) which intersect l\. Thus if 
we define H(U) to be those lines in H(U) which do not intersect either Z or Zi, 
then we have from (23) that 

\H(Q\ > \F\ 3 ^ (24) 

From (24) we have 

|{(Z*,Z) 6l,xL':(e H(Q}\ > |F| 3 / 2 - Ce |L,|. 

By Lemma 5.1 (using the bound \L'\ ~ \F\ 2 ) we thus have 

e i* x x L' :le H(h),H(Q;h ^ K}\ > l^ 1 "^^! 2 . 

By the pigeonhole principle we may thus find an Z* 6 L* such that 

|{(/:,0 £L*xL':le H(U),H(Q;h + Q\ > l^ 1 "^*!. 

Fix Z*. If we write L' t := L* — {Z»}, we thus have 

\{(Cl)GLixH(Q:lGH(Q}\>\F\^\L,\. 

Let us eliminate some degenerate lines in . Consider first the contribution of the 
lines Z* which contain the point Z* H Z . Such lines also intersect l\ and thus must 
lie on the plane containing Z* and l\. The line Z must also lie in this plane. Since 
this plane contains at most 0(|F| 1//2+c,£ ) lines in V by the corollary to Lemma 8.2, 
we thus see that the total contribution of this case is at most \F\ 1+Cs , which is 
much smaller than |F| 1_Ce |L*| by (20). A similar argument deals with those lines 
Z* which contain the point Z*flZi. Thus if we define L" to be those lines in L'^ which 
do not contain either Z* n Zo or Z* n Zi, then we have 

|{(0) £ L'i x H(Q : Z e H(Q}\ > \F\ 1 - Ce \L.\. 

If we thus define the quantity 

MO == I0: e L'i : l e H(Q}\ 

then we have 

E MO > W^LA- 

From Lemma 8.3 and the popularity argument, if we thus set 
H' := {Z e H(Q : MO > \F\-^ 2 - Ce \L4} 

then we have 

EMO^I 1 -^*!- (25) 

leH' 
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From Lemma 8.2 we see that 

MO < |F| 1/2+C£ . 

Indeed, if one unravels all the definitions, we see that the lines which appear in 
the definition of lie in L' and also intersect the three disjoint lines I, lo, l\. 

From this bound and (25) we have a lower bound on H'\ 

\H'\ > \F\^ 2 -^\L4. (26) 
Comparing this with Lemma 8.3 we thus obtain an upper bound on L*: 

\L*\ < (27) 

We now perform an algebraic transformation to convert this three-dimensional 
problem into a two-dimensional problem. To avoid confusion we shall use the 
boldface font to denote two-dimensional quantities. 

Lemma 8.4. There is a map II from lines in L" to points H(l'*) in the plane F 2 , 
and a map A from lines I in H' to lines A(Z) in the plane F 2 , with the following 
properties: 

• II is injective on L". 

• For each line 1 in F 2 , the fiber A _1 (l) has cardinality at most |F| 1 / 2+Ce . 

• If G L" x H' is such that I E then the point n(^) lies on the 
line A(l). 

Let us assume this lemma for the moment, and conclude the proof of the Theorem. 
Define the two-dimensional set of points P := LT(L") and the two-dimensional set 
of lines L := A(H'). From (27) we have 

|P| < \L*\ < \F\ 1+C£ . (28) 

From the multiplicity of A and (26), (20) we have 

|L| > \F\-^ 2 - Ce \H'\ > |F|- C£ |L*| > IFI 1 ^. (29) 

Now for each / S H', we have 

\{K G L» : I G H(Q}\ > \F\-^ 2 - Cs \L4 > |F|V2-C, 

by (20) and the definitions of H' and n' . Applying the first and third parts of 
Lemma 8.4, we thus conclude that 

{peP:pel}|> IFI 1 / 2 -^ (30) 

for all 1 g L. 

Set N := |P|. From (28), (29) we can find a subset L' of L with 

\F\- Ce N < |L'| < N. 
Combining this with (30) and then (28) we obtain 

{(p, 1) g P x L' : p e 1}| > |F| 1/2 - Ce A^ > N 3/2 - Cs . 
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But this contradicts the Szemeredi- Trotter type estimate in Theorem 6.2 if e was 
chosen sufficiently small. This proves the theorem. 

It remains to verify Lemma 8.4. It is convenient to work in co-ordinates, and for 
this we shall first normalize the three lines lo, /*, l\. Recall that lo and l\ are skew, 
while l* intersects both Iq and l\. After an affine linear transformation, we may set 



lo 


:={(*, 0,0) 


xeF) 




:={(0,0,z) 


zeF} 


h 


:={(o,y,i) 


yeF} 



Consider a line l'^ G L". This must intersect the line lo in some point (x,0,0) with 
x ^ 0, and intersect the line l\ in some (0,y, 1) with some y ^ 0. Thus l'^ has the 
form 

li = {((l-t)x,ty,t):t€F}. 
We define the map II : L" -> F 2 by 

n(0:= (i/x,i/y). 

Clearly IT is injective (since two points determine a line). 

Now we consider a line I G H' . This line must intersect i* at some point (0,0, z) 
with z 0, 1. Thus I takes the form 

l = {(a(t-z),b(t-z),t):teF} (31) 

for some a, 6 G F; note that a, b ^ since Z is disjoint from l and l\. Suppose this 
line I intersects the line l[ mentioned earlier. Then we must have 

a(t — z) = (1 — t)x; b(t — z) = ty 

for some t G F. Using some algebra to eliminate t, we eventually end up with the 
constraint 

xy + (bz — b)x + yaz = 
or (dividing by the non-zero quantity xy) 

l + (bz- b)- + az- = 0. 

y x 

Thus if we set A(Z) to be the line 

A(Z) := {(A, Y) G F 2 : 1 + {bz - b)Y + azX = 0} 

(note that this is indeed a line since a, b ^ and z ^ 0,1), then we see that 
U(Q G A(Z) as desired. 

It remains to verify the second property of Lemma 8.4. We consider a line of the 
form {(A, Y) G F 2 : 1 + /3Y + aX = 0} for some a, f3 ^ (since these are the only 
lines in the image of A), and let L a ^ denote the inverse image of this line under A 
in H'; thus L a .p consists of all the lines in H' of the form (31), where bz — b = [3 
and az = a. Observe that all such lines must intersect U, and must also intersect 
the lines {(— a, y,0) : y G F} and {(x, — /?, 1) : x G F} (indeed, the intersection 
points are (— a, — bz,0) and (a(l — z),— (3, 1) respectively). These three lines are 
disjoint, so by Lemma 8.2 we have \L a jj\ < |i^| 1 / 2 + Ce as desired. This concludes 
the proof of Theorem 8.1. ■ 
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